DIONYSUS: Towards Query-aware Distributed Processing of RDF Graph Streams
نویسندگان
چکیده
Arguably, the most significant obstacle to handle the emerging application’s data deluge is to design a system that addresses the challenges for big data’s volume, velocity and variety. Work in RDF stream processing (RSP) systems partly addresses the challenge of variety by promoting the RDF model. However, challenges like volume, velocity are overlooked by existing approaches. These challenges demand optimised combination of scale-out and scaleup solutions. Furthermore, various other requirements for RSP systems, such as an efficient integration of distributed stream sources, storage of historical streams and their analysis, and integration of stateful operators to support complex event processing over streams are far from being addressed in an efficient way. Our vision is to design a general purpose RDF graph streaming system, which will be able to cope with distributed streams and shares local optimising strategies to allow different kinds of queries (analytical, streaming, sequence-based) through one query interface. The proposed system will offer a black-box solution that will allow analysts to tap in the goldmine of massive RDF graph streams. We consider the challenges and opportunities associated in designing such system, introduce our approaches to these topics, and discuss the components of our envisioned system.
منابع مشابه
Towards Efficient Semantically Enriched Complex Event Processing and Pattern Matching
Management and recognition of event patterns is becoming thoroughly ingrained in many application areas of Semantically enabled Complex Event Processing (SCEP). However, the reliance of state-of-the-art technologies on relational and RDF triple model without having the notion of time has severe limitations. This restricts the system to employ temporal reasoning at RDF level and use historical e...
متن کاملAn Adaptive Framework for RDF Stream Processing
In this paper, we propose a novel framework for RDF stream processing named PRSP. Within this framework, the evaluation of C-SPARQL queries on RDF streams can be reduced to the evaluation of SPARQL queries on RDF graphs. We prove that the reduction is sound and complete. With PRSP, we implement several engines to support C-SPARQL queries by employing current SPARQL query engines such as Jena, g...
متن کاملDistributed Stream Reasoning
Stream Reasoning is the combination of reasoning techniques with data streams. In this paper, we present our approach to enable rule-based reasoning on semantic data streams in a distributed manner. Data streams are being continually generated in diverse application domains such as traffic monitoring, smart buildings, and so on. Continuous processing of such data has been intensively investigat...
متن کاملProcessing SPARQL Queries Over Linked Data-A Distributed Graph-based Approach
We propose techniques for processing SPARQL queries over a large RDF graph in a distributed environment. We adopt a “partial evaluation and assembly” framework. Answering a SPARQL query Q is equivalent to finding subgraph matches of the query graph Q over RDF graph G. Based on properties of subgraph matching over a distributed graph, we introduce local partial match as partial answers in each f...
متن کاملRules and RDF Streams - A Position Paper
We propose a minor extension of the Graph Store Protocol and the SPARQL syntax that should be sufficient to enable the application of rules to some kinds of RDF Streams (as defined by the the RDF Stream Processing W3C Community Group) to generate new RDF streams. The IRI identifying an RDF stream is extended with a query string field-value pair defining a sequence of windows of the stream. The ...
متن کامل